Hst.723 Neural Coding and Perception of Sound Theme 4: Pitch and Temporal Coding
نویسنده
چکیده
Pitch is often referred to as a primary parameter in music, a basic concept upon which other musical categories, such as pitch intervals and harmony, can be built (Snyder, 2000). While ANSI defines pitch primarily on a uni-dimensional space, Shepard (1982) postulated a multi-dimensional spatial model, such that the Euclidean distances between the musical tones (in the Western Tonal Tradition) reflect their perceived relations. This complex percept is important for human not only to appreciate melody and harmony in music (Dowling, 1999), but it is also used to convey prosody in speech (Winter et al, 2001); to segregate multi speakers talking simultaneously (Darwin, 1997); and in tonal languages, such as Cantonese, to carry lexical meanings (Fok, 1974). However, our understanding of the coding mechanism of pitch is only at a basic level. There are two competing theories of how frequency, a physical quantity that pitch percept is dependent on, is coded. The first one is the “place” theory, based on the spectral analysis performed in the inner ear and the fact that different frequencies excite different places along the basilar membrane (Moore, 2003). The other is the temporal coding theory, in which the frequency is related to the time pattern of the neural impulses evoked by the stimulus. There are also different mechanisms of complex pitch perception that have been postulated in the last few decades, ranging from Schouten’s temporal theory, and pattern recognition theory proposed by investigators such as Goldstein and Terhardt, to variations on Licklider’s autocorrelation model, which incorporates both the place and timing information in their pitch extraction algorithms. In this theme report, a sample of the vast and diverse research in pitch perception is presented. Literature reporting elements that influence pitch judgment, such as spectral weighting and auditory grouping mechanisms, are reviewed. Neurophysiological studies are surveyed to ascertain whether there are anatomical correlates that support the aforementioned pitch extraction models. Finally, a brief sample of studies in pitch perception at the cortical level is summarized.
منابع مشابه
Harvard-mit Division of Health Sciences and Technology Hst.722j: Brain Mechanisms for Hearing and Speech Course Instructors: Absolute Pitch Hst.722 Brain Mechanisms for Hearing and Speech
Pitch is a fundamental attribute of sound, which has led to extensive research on pitch processing, categorization, and memory with the goal of elucidating the complex workings of the auditory system. The phenomenon of absolute pitch (AP), the ability to identify or produce a specified pitch without external reference, provides a unique opportunity to study the perception and neural coding of p...
متن کاملLearning Pitch with STDP: A Computational Model of Place and Temporal Pitch Perception Using Spiking Neural Networks
Pitch perception is important for understanding speech prosody, music perception, recognizing tones in tonal languages, and perceiving speech in noisy environments. The two principal pitch perception theories consider the place of maximum neural excitation along the auditory nerve and the temporal pattern of the auditory neurons' action potentials (spikes) as pitch cues. This paper describes a ...
متن کاملNeural coding of periodicity in marmoset auditory cortex.
Pitch, our perception of how high or low a sound is on a musical scale, crucially depends on a sound's periodicity. If an acoustic signal is temporally jittered so that it becomes aperiodic, the pitch will no longer be perceivable even though other acoustical features that normally covary with pitch are unchanged. Previous electrophysiological studies investigating pitch have typically used onl...
متن کاملProportional spike-timing precision and firing reliability underlie efficient temporal processing of periodicity and envelope shape cues.
Temporal sound cues are essential for sound recognition, pitch, rhythm, and timbre perception, yet how auditory neurons encode such cues is subject of ongoing debate. Rate coding theories propose that temporal sound features are represented by rate tuned modulation filters. However, overwhelming evidence also suggests that precise spike timing is an essential attribute of the neural code. Here ...
متن کاملHierarchical spike coding of sound
Natural sounds exhibit complex statistical regularities at multiple scales. Acoustic events underlying speech, for example, are characterized by precise temporal and frequency relationships, but they can also vary substantially according to the pitch, duration, and other high-level properties of speech production. Learning this structure from data while capturing the inherent variability is an ...
متن کامل